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[A VLSI NEURAL FUZZY 
CLASSIFIER FOR HANDWRITING 

RECOGNITION] 

Background of Invention 

[0001] Field of the Invention 

[0002] The present invention relates to a handwriting recognition device; and more 

particularly, to a handwritten numeral recognition device using fuzzy logic and cellular 
neural network. 

[0003] Description of Related Art 

[0004] The influx of personal digital assistants (PDA) and touch-screen electrical 

appliances has created a demand for a small, Instantly-recognizable, and robust 
handwritten character translator (HCT). Handwriting recognition devices are devices 
that attempt to identify handwritten images based upon an existing character in order 
to output recognized images. One inherent difficulty of applications related to 
handwritten character recognition is that handwritten characters are variable on an 
individual basis. Moreover characters are usually separated into alphabets, numerals, 
and symbols despite the difference characters of the language Itself. Therefore it is 
advantageous to separate the handwriting recognition into individual systems for 
increasing its speed, robustness, and efficiency. 

[0005] 1^ order to accommodate handwritten recognition devices to an individual's 
handwriting style, typical handwritten recognition devices will firstly operate in 
software using a trained character, whereby a user will place the device in a training 
mode and enter in the user's numeral on a regular basis via a graphic user interface, 
such as a digital writing tablet. The numerals input by the user are then used as a 
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template for use in comparing other numerals that are handwritten by the user via the 
graphical user interface. In such a case, however, the operation of the existing 
handwritten character recognition devices is based on the assumption that an 
individual's writing style is relatively uniform. Thus, if a user's handwriting varies due 
to fatigue or stress, the handwritten characters may be unrecognizable. Thus, to 
ensure recognition, a user may often be required to write slowly and carefully. In 
addition not only do individuals have different handwriting styles, but an individual's 
own handwriting may change over time and moreover the characters may be smaller 
or larger In different instances. Devices attempting to perform handwritten character 
translation may make incorrect decisions due to an inability to accommodate a variety 
of handwritten character shapes and sizes. In addition, these devices would be 
relatively slow due to the software implementation. Finally, the existing handwritten 
character recognition devices suffer the disadvantage of being user-specific, and 
would not work for another user because of the individual variations in handwriting. 

[0006] 



Because of the fuzzy nature of human handwriting, it makes sense to adapt "fuzzy logic" 
into the handwriting recognition device. The "fuzzy logic" control theory is well known in the 
art of data processing. Rather than evaluating the two values "TRUE" and "FALSE" as in digital 
logic, fuzzy terms admit to degrees of membership in multiple sets so that fuzzy rules may 
have a continuous, rather than stepwise, range of truth of possibility. Therefore non-identical 
handwritten numeral from same or different users can be approximated using fuzzy logic for 
fast and robust handwriting recognition. The conventional method of fuzzy recognition relies 
upon the maximum membership principle: 
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/iii(jUo) = max {jLiii(fio), ^2(//o), A, /ft.v(//o) } (1) 
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therefore /loc At , where 



are N standard characters M ^ is the object to be recognized and N is the total number of 
classes. This conventional method is a single factor equation and can only recognize the 
handwritten character as a whole and find the closest standard character for to-be- 
recognized character. Therefore the conventional method is very restricted in terms of speed, 
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robustness, and accuracy. To enhance system performance, it is necessary to find two or 
more of tlie closest standard characters for the to-be-recognized character and the standard 
characters. 

[0007] Generally, analog fuzzy logic is constructed by multi-value logic circuit units, 
which may be of a voltage type or a current type. For conventional voltage type 
circuits, operational amplifiers are required for summation or subtraction operations 
to voltages, which makes the circuit complicated. On the contrary, the current type 
circuit is capable of proceeding summation and subtraction operations to currents and 
thus simplifies the circuit and is used In present invention. In addition, the operating 
speed of a current type circuit is generally higher than that of the voltage type circuit 
because the gain bandwidth of the operational amplifier restricts the operating speed 
of the voltage type circuit. Moreover, in a voltage type fuzzy logic circuit, switch 
capacitors are usually required, which increases the size of a chip for the circuit 
because a large chip area is required to fabricate a capacitor. The use of switch 
capacitors also increases the complexity of manufacturing a chip for the circuit as two 
polysilicon layers are required for fabricating a switch capacitor. The fabrication of a 
current switch for the current type fuzzy logic can be done by standard digital CMOS 
technology and thus reduce the complexity of manufacturing a chip for the circuit. 
Accordingly, the present invention provides a switch current type fuzzy processor for 
high-speed character recognition. 

[0008] Furthermore, a handwritten character is mostly unique even written by the same 
user because it can vary in size, magnitude, curvature, stroke, and etc ... Therefore a 
fuzzy logic handwritten character translator Itself is not sufficiently fast and robust in 
recognizing the entire character. As a result, it is a necessary to include a feature 
extractor that capable of high-speed parallel signal processing. Cellular neural 
network (CNN) is the best choice because it allows real-time signal processing found 
within a digital domain and local interconnection features for VLSI implementation. A 
feature extractor uses a 24x24 pixels CNN to perform extraction of the handwritten 
numeral image. The extracted Image is then decoded into various feature groups 
consisting of different feature, a feature is the primitive of a character such as a dot, 
line, or curve. 
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Summary of Invention 

[0009] It is an object of the present invention to provide a handwriting recognition device 
using CNN and fuzzy logic to recognize numerals despite variations in the character. 
Furthermore, the present invention provides a handwriting recognition device that can 
directly classify the normalized handwritten digit image. 

[0010] It is an object of the present invention to provide a handwriting recognition device 
that have the properties of parallelism, trainability, and robustness. 

[001 1] It is an object of the present invention to have a smart structure that allows 
membership function programmability and output configurability. 

[0012] It is a further object of the present invention to provide a handwritten character 
recognition device capable of identif/ing handwritten numeral regardless or the size 
or scale of the handwritten numeral. 

[0013] It is a further object of the present invention to provide a current-type 

handwriting recognition device that is efficient, low cost, and easy to fabricate in VLSI. 

[001 4] In order to achieve these and other objects, the present invention provides a 

handwriting recognition device namely the classifier comprising two main parts. The 
first extraction unit comprises a 24-bits connected component detector (CCD) feature 
extractor using CNN and a feature compression block; and the second membership 
function generator comprises a 1 0x1 0 membership function generator (IS/IFG) array, a 
10 switched current (SI) integrators, and a 1 1-port k-winner-takes-all (k-WTA) circuit. 

[001 5] According to the present invention, the handwriting recognition device uses fuzzy 
logic and CNN to determine the optimum numeral candidate from a library of 
predetermined sets of character sequences. Specifically, the handwriting recognition 
of the present invention uses a two dimensional synthetic membership function in 
order to classify the extracted features of a scanned numeral image. The synthetic 
membership function represents a degree of an input numeral belonging to an object 
class as a whole in terms of different features. A "feature" is defined as a basic 
component of a handwritten character such as a dot, a curve, a line such that a 
handwritten character is composed of a combination of features or feature groups. 
The weighted sum synthetic membership function has better generalization ability 
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since It can reinforce the contribution of different feature groups to different classes. 
As a result, the weighted sum synthetic membership function is much faster and more 
robust by recognizing the features and feature groups of a scanned image. There are 
many choices for the synthesis functions. The most frequently used are the minimum- 
finding function but it is not suitable for character recognition since it only 
emphasizes a local feature and neglect other features. The summation function is able 
to include the relations between all features whereby it is suitable for pattern 
recognition. Therefore the summation function is adopted in the present invention. 
When the weighted sum synthetic membership function Is combined with the 
maximum membership principle, It can give the final recognizing results. 

[0016] In accordance to the preferred embodiment, the current mode VLSI classifier has a 
I/O circuit for inputting and outputting a plurality of membership functions. An 
extraction unit comprising a CCD extractor with a CNN structure and a compression 
unit receives a to-be-recognized character having a plurality of input features for 
generating a plurality of features values after compression. A membership function 
generator stores the plurality of membership functions and receives the plurality of 
features values to generate a plurality of current-type membership degrees. A 
plurality of switched-current integrators receives the plurality of current-type 
membership degrees for generating a plurality of synthesis membership degrees. A k- 
WTA circuit Is provided for comparing the plurality of synthesis membership degrees 
and output the plurality of synthesis membership degrees as well as the 
corresponding characters in an order of magnitude. The I/O circuit provides the final 
recognizing result. 

[001 7] It is to be understood that both the foregoing general description and the 

following detailed description are exemplary, and are mtended to provide further 
explanation of the invention as claimed. 



Brief Description of Drawings 



[001 8] 



The accompanying drawings are included to provide a further understanding of 
the invention, and are incorporated in and constitute a part of this specification. The 
drawings illustrate embodiments of the invention and, together with the description, 



serve to explain the principles of the invention. In the drawings, 
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[0019] FIG. 1 is a schematic diagram of the structure of the classifier according to the 
preferred embodiment of the present invention. 

[0020] FIG. 2 is a circuit diagram of a cell circuit of the CCD extractor according to the 
preferred embodiment of the present invention. 

[0021] FIG. 3 is a circuit diagram of the feature compression unit according to the 
preferred embodiment of the present invention. 

[0022] FIG. 4 is a circuit diagram of one column of the fuzzy weighted sum of the 

membership function generator according to the preferred embodiment of the present 
invention. 

[0023] FIG. 5 is a circuit diagram of the SI integrator according to the preferred 
embodiment of the present invention. 

[0024] FIG. 6 is a circuit diagram of the k-WTA according to the preferred embodiment of 
the present invention 

Detailed Description 

[0025] 



Please refer to FIG. 1, the structure of the neural-fuzzy classifier of the present invention 
for handwriting recognition is illustrated, which comprises an extraction unit 10, a 
membership function generator 1 9, a clock generator and logic controller 1 3, a column 
decoder 1 4, an I/O circuit 1 8, and k-WTA circuit 1 7, wherein the I/O circuit 1 8, and k-WTA 
circuit 1 7 have 1 1 ports. The extraction unit 1 0 further comprises a CCD extractor 1 1 and a 
compression unit 1 2, wherein the CCD extractor is 24 bits. The membership function 
generator 1 9 further comprises a 10x10 programmable membership function generator (MFC) 
array 1 5, and a SI integrator 1 6, wherein the SI integrator 1 6 has 1 0 switches. In the 
membership function generator 19, the weighted-sum synthetic membership function used is 
shown in equation (2): 
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Where i=l , 2, ... , N is the index of class number, J is the number of feature groups, W Is the 

U 

th th 
weight of the j groups feature belonging to the i class, K Is the length of each group 

feature codes, m is the Input feature code and \a (m ) is the membership of feature 
jk.x Ijk ij 

t h 

code m belonging to the 1 class. For the present invention, N= 1 0, J=8, and K= 1 0 are 
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used in the preferred embodiment. En is an enable signal for enabling MFC array 1 5 via I/O 
circuit 1 8 to control the operation of the classifier. The logic operations of the classifier are 
controlled by a number of non-overlapping clock signals ckl and ck2 by the clock generator 
and logic controller 13. 

[0026] The 24-bit CCD extractor 1 1 is a cellular neural network structure that can be 
easily implemented in CMOS circuitry. The CCD extractor 1 1 scans a normalized 24 
pixels X 24 pixels handwritten black and white image of a numeral in two directions 
and two diagonal lines. The pixels of the normalized numeral consisting of different 
features are black and the rest of the image pixels Is white, however inverted images 
can also adapted. The raw data of the scanned image from the 24-bit CCD extractor 
1 1 is sent to the compression unit 1 2 for compression into smaller data. The 
peripheral edge pixels of the image are always white so the effective image size 
decreases to 22 pixels x 22 pixels. The scanned image therefore contains 80 bits of 
CCD feature codes which includes two added dummy bits. These compressed codes 
will be fed into the 1 0x1 0 pre-programmed MFC array in eight time cycles which are 
controlled by the clock generator and logic controller 1 3. 

[0027] The I/O circuit 18 provides the programming codes of the membership functions 
under the control of the column decoder 1 5. The results jl/{X) is immediately 
outputted to and stored in the 1 0 SI integrator 1 6 until all the features codes are 
transmitted. The maximum membership function is applied to the results of the 
weighted sum synthetic membership function by the k-WTA circuit 1 7 to generate the 
highest matching results. The results are then sent to the I/O circuit 1 8 to give the 
final recognizing results. 

[0028] 

FIG. 2 illustrates a single cell circuit of the CCD of the present invention. The 24- 
bit CCD extractor is constructed by 24 cascaded cells to form the 24x24 pixels feature 
extractor. Each cell circuit comprises 4 cascaded current mirrors connected to the 
output of the cell circuit. There are 4 transmission gates controlling the input of the 
CCD extractor. The input signal of the CCD extractor 1 1 is connected to the input of 
transmission gate 20, transmission gate 21 and 23 through a transistor. The output of 
transmission gate 20 is connected to a node further connected to the input of 
transmission gates 21 and the gate of transmission gate 22, and the gate of 
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transmission gate 20 is connected to a ST signal. The input of transmission gate 21 is 
connected to the input signal, the gate of the transmission gate 21 is connected to the 
output of the transmission gate 20, the other gate of transmission gate 21 Is 
connected to ground, and the output of transmission gate 21 is connected to the 
current mirror 24. The input of transmission gate 22 is connected to the output of 
transmission gate 20, the gate of the transmission gate 22 is connected to a WR 
signal. The input of transmission gate 23 is input signal of the CCD extractor 1 1 , the 
gate of transmission gate 23 is connected to the output of transmission gate 1 2, the 
other gate of transmission gate 1 3 is connected to ground, and the output of 
transmission gate 1 3 is connected to current mirror 1 7 and further to the output of 
the CCD extractor. Current mirrors 14, 15, 16, and 1 7 are connected in cascaded 
manner. HSPICE simulation of this CCD extractor gives good performance. 

[0029] The CCD feature extractor scans a normalized 24x24 pixels handwritten black 
numeral image row by row in horizontal direction and two diagonal lines. The 
peripheral pixel of the image is always white so the effective image size is 22x22 
pixels. There are 4 bits in each feature code but the first bit is discarded because it 
only provides information of normalization of the image, therefore each feature code 
has 3 bits because the scanned images are normalized in the preferred embodiment. 
The 3x22 bit feature codes are thus obtained after scanning all the rows of the image 
and the feature codes are read into a corresponding 4x22 register array in the proper 

th 

order. A diagonal line is obtained by scanning and shifting the i original pixel in the 
th 

i row of the image to the corresponding bit of the last register column. After 
scanning all the rows of the image, the data in the register array is sent back into the 

CCD feature extractor column by column to get 9 bits of CCD features, where the 

diagonal line gives 3 bits results and the other three columns give 6 bits. 

Furthermore, the edge bits in each column are left idle and therefore the data is 

reduced to 20 bits. The 20 bits of the middle of each column are compressed by the 

feature compression unit 12 to 10 bits. 



[0030] 



FIG. 3 shows the logic circuit of the feature compression unit 1 2. The feature 
compression unit 1 2 comprises at least 3 XOR gates connected together to control the 
logic of the templates A, B, and C. XOR gates 31, 32, and 3 3 are coupled together like 
a triplet, wherein XOR gates 31 , 32 share an input Al , and XOR gates 32, 33 share an 
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input Bl . XOR gates 37, 38, and 39 are coupled together like a triplet, wherein XOR 
gates 37, 38 share an input Al , and XOR gates 38, 39 share an input Bl . XOR gates 
34, 35, and 36 are coupled together like a triplet and their outputs are the templates 
A, B, and C respectively. Wherein the input of XOR gate 34 is coupled to both the 
output of XOR gate 33, 37, the input of XOR gate 35 is coupled to both the output of 
XOR gate 32, 38, and the input of XOR gate 36 is coupled to both the output of XOR 
gate 31 , 37. Since both the template B and bias input I of the CCD extractor 1 1 are 
zero, only one row of the template A takes effect. This feature compression unit is 
easily implemented in CMOS circuit. 

[0031] As a result, the feature compression unit 12 outputs 39 bits where 10 bits are 

from each of the 3 columns and 9 bits are from the diagonal line are acquired each in 
the horizontal and vertical directions. A total of 80 bits including 78 scanned bits and 
2 dummy bits are sent to the programmable MFC under control of clock generator and 
logic controller 1 3. 1 0x1 0 bits are sent to the programmable MFC every clock cycle, 
and the 78 bits of scanned data requires 8 clock cycle to complete, wherein 1 dummy 
bit is added every four cycles. 

[0032] ^ shows one column of the 10x10 MFC array 15 with 3 membership function 

memory units shown and the 1 0 SI Integrator 1 6. The circuit unit A 40 is a fully 
symmetrical membership function array in which all the NMOS transistors are of the 
same size. There are 1 0 columns In the 1 0x1 0 MFC array and each column has 1 0 
membership function memory units which store the membership degree 

. . ^ . . . th ^ ^ th 

corresponding to the i feature value of the j feature of the i standard 

character. The membership function memory unit comprises at least 9 transistors, 
wherein 8 transistors are used in pairs in the aspect ratio 0.5:1 :2:4 and controlled by 
control signals DO, Dl, D2, and D3, respectively and one transistor such as MO, Ml, 
and M9 acts as a gate by receiving high and low signals. The MFC array are addressed 
by the input feature signals F0-F9 from the feature compression unit 12 in 
combination with the adjacent but non-overlapping timmg signals CKl and CK2 

thttth 

pulses, each correspondmg to an input feature. tjsadfThe 80 bits scanned 

image is sent to the programmable MFC for feature recognizing. The 1 0 columns of 

feature codes calculate the fuzzy weighted sum core of the membership function 

generation using fuzzy logic from equation (2). An off-chip EPROM unit stores the 
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genetic algorithm off-line learning which will later become control codes for the 

membership function generator. The feature codes are compared by the membership 

functions against the standard characters stored in the EPROM. The membership 

functions are programmable and controlled by digital control signals D0-D3 and the 

value of all the membership function ranges from 0,to 7.5x1 in steps of 0.5x1 

ref ref 

wherein the I ^ is a reference current set at 4 m A. The weight is programmable 
ref U 

and realized by part in dashed box and the value ranges from 0 to 3.75 in steps of 
0.25. A group of 1 0-bit feature codes are sent to the 1 0x1 0 programmable MFC 1 5 
array generating 1 0 weighted part-sum currents In every clock cycle. 

[0033] In FIG. 5, the SI integrator 16 performs an accumulation operation on the current- 
type values of weighted membership degrees to obtain the current-type value of a 
synthesis membership. The SI integrator 1 6 is implemented by a switched-current 
technique which introduces the second generation switched-current storage to 
overcome disadvantages of the first generation switched-current storage. To stabilize 
the drain voltage of a storage transistor, a RGC (Regulated-Gate Cascode) structure is 
adopted to construct a storage unit. Although the fully differential integrator can be 
adopted in this classifying system, the single ended RGC integrator is preferably used 
because of its simplicity. The SI integrator 16 accumulates the input current one by 
one in each cycle. Therefore the 80-bit scanned image requires 8 clock cycles to 
complete the transmission. Timing controls of the switches are controlled by two non- 
overlapped clocks CKl and CK2 shown in FIG. 4. In the CKl clock cycle an input 

current I , is sent to the integrator, and in CK2 clock cycle the output current I , is 
1 low 

generated. The switched-current integrator 16 will accumulate all the weighted part- 
sum currents in every clock cycle and the total weighted sum currents are obtained 
and outputted to the 1 1 -port k-NAHTA 1 7 after 8 cycles. The output of SI integrator 1 6 
is regulated by a current mirror with a 1 :1 ratio with a current source of I connected to 
the output. 



[0034] 



The 1 1 -port k-WTA circuit 1 7 is shown in FIG. 6 which comprises a circuit unit A 
62, a circuit unit B 64, and a circuit unit C 66. The k-WTA circuit is used to compare 
and sort the synthesis membership degree currents corresponding to the N standard 
characters and the to-be-recognized character to sequentially output high voltage 
pulses on the outputs corresponding to the N standard character in order of 
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magnitude of the current as a recognition result. If the classifier is used in parallel or 

cascade to improve system performance, the k~WTA circuit must be expanded in 

order to compare and sort M number of classifiers times N standard characters. 

Therefore the circuit unit A Is easily expandable to a larger network depending on its 

application. All transistors work in strong inversion therefore the WTA network is high 

speed. The circuit unit A 62 comprises three identical input units where i q » • ^ * ^"cl i 

^ designate three input currents representing three synthesis membership degrees 

which are directed and output to the circuit unit C 66. The k-WTA circuit 1 7 receives 

the input currents to find the maximum among them and output three voltage signals 

indicating the maximum one. A number of control logic signals Bl and B2 are sent to 

the circuit unit A for enabling the individual maximum-finding circuit by a "high" 

signal. A voltage V is applied to circuit unit A so a current is flown through the 
b 

differential transistors having the maximum input voltage and produce a drain current 

I . when the drain current of other differential transistors are zero. The circuit unit B is 

an output circuit of the k-WTA network. The clock cycle CKl and CK2 controls the 

current source of the diodes and the final output I of the 1 1 -port k-WTA circuit 1 7 

o 

is passed through a 1 :1 current mirror. The k-WTA selects largest k value of each 
group of feature codes at one time from k classes with a simple set of complementary 
features. If signals Bl and B2 are zero voltage levels, the value of k is 1 so the WTA is 
reconfigured as 1-WTA. When any one of the Bl or B2 is high voltage level, the value 
of k becomes 2 so the WTA becomes 2 -WTA. 

[0035] It will be apparent to those skilled in the art that various modifications and 

variations can be made to the structure of the present invention without departing 
from the scope or spirit of the invention. In view of the foregoing, it is intended that 
the present invention cover modifications and variations of this invention provided 
they fall within the scope of the following claims and their equivalents. 
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